ShareCaptioner is an open-source image description generation model. It is based on the improved InternLM-Xcomposer-7B base model and fine-tuned on the ShareGPT4V dataset assisted by GPT4-Vision. It can generate high-quality image descriptions.
Image-to-Text
Transformers